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BACKGROUND : 

The advent of the worldwide web has made available to 
the public in digital form a vast amount of government, 
corporate and other entity technical and non-technical of 
all types. This information, resident in huge databases 
owned or managed by government and corporate entities, is 
generally available for public or private use either as a 
free service or for a download, subscription, or other fee. 
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A strong need exists for various companies, entities 
to use this information to analyze the activity of various 
entities, collection of entities, government agencies, 
industry segments, trade association, scientific, and other 
societies, etc, (hereafter "^entity" or '"entities") . For 
example, the board of directors of company A may want to 
know (independently of internal company reports) how active 
the company has been in developing a particular product or 
technology and how active its competitor company B has been 
in the same field. At present, information available on 
the net can be searched for information that relates to the 
mission by accessing the U.S. Patent & Trademark Office 
patent database and other on-line databases containing 
technical articles, journals, products announcements, and 
even SEC filings for merger and acquisition information. 

A serious problem the user immediately encounters is 
the inability to reliably search for and find the precise 
information under consideration. Although Boolean search 
techniques are provided by the database management 
companies, such technologies produce too many documents for 
the user's consideration which often wastes user time and 
cause user to abort the effort. 
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A second problem encountered relates to the 
organization and presentation of the results, which simply 
includes presentation of the document portions themselves 
in a purported order of significance. Since the user 
desires to select the specific documents of interest, user 
must wade through many words, sentences, etc. that are not 
directly relevant to the mission concept. 

Fortunately, great strides have been taken to enable 
users to locate specific documented information of all 
types specifically, U.S. Patent Application SN 09/541,182 
filed April 3, 2000 discloses a new system and method of 
semantically processing natural language documents to build 
a Subject-Action-Object Knowledge Base (SAO KB) which is 
presented to the user in an efficient and effective manner. 
Accordingly, user can control the presentation of SAO 
folders organized either as problem folders AO' s or as 
solution folders S's. Selection of a specific AO or S will 
display all Ss or AOs (respectively) stored in association 
with the selected common AO or S. 

SUMMARY OP EXEMPLARY EMBODIMENT OF THE PRESENT INVENTION: 

It is an object of the present invention to provide a 

system or method for automatically analyzing a set of 

documents residing in one or more local and/or remote 
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databases having a relationship to a user entered criterion 
that relates to one or more entities activities or a 
segment of such activities from data in one or more SAO 
Knowledge Bases for each entity, organizing the access and 
display of such SAO Knowledge Bases such that common AOs or 
Ss are displayed for quick user understanding of potential 
activities for each entity under consideration. User 
selection of a displayed S or AO will cause the AOs or Ss 
respectively stored in association with the common selected 
S or AO. In a preferred embodiment, the displayed Ss, or 
M AOs, for each entity are prioritized or ranked and 

{.^ displayed in order. For example, the Ss displayed in 

S column form, the first S having the highest number of AOs 

n associated therewith, the next S having the second highest 

W nuinber of AOs associated therewith, etc. This display 

p quickly gives user a general idea of the subjects and 

therefore the technologies most appearing in documents 
under consideration and thus an indirect indication of the 
entity's activity or interest. 

If user wants to see the detailed problems (or 
solutions) the entity uses the subject (S) to address, user 
simply selects an S folder and the AOs associated with the 
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selected S are displayed thereunder or in visual 
association therewith. 

As mentioned above, user can organize the displayed 
results by AOs in order of having the most to the least 
number of Ss associated with common AOs, This display 
quickly indicates to user the entities activities in 
addressing a specific problem or application. 

To compare problem occurrence or technology occurrence 
between document sets of two or more companies, the present 
system can display the S-AO or AO-S folders in side-by-side 
display. 

In addition, the number of documents (e.g., U.S. 
patents and published patent applications), can be counted 
and displayed in which the gross total of SAOs appear for 
each entity and for respective Ss or AOs within each 
displayed folder. 

Also, if desired and as recommended^ the initial count 
of SAOs and corresponding S and AO folder can include only 
those in the SAO Knowledge Base found in the summary of 
invention section and/or the detailed description section 
of processed patent and patent application texts. Thus, 
results yield more reliable data and conclusions since SAOs 
from the background (related to prior art) and claims 
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(related to legal terminology) would not contribute to the 
final S-AO counts. 

DRAWINGS : 

Other and further objects and benefits of the present 
invention shall become apparent with the following detailed 
description when taken in view of the appended drawings in 
which: 

Figure 1 shows one example of a screen display of a 
system according to the principles of the present invention 
in which user can start a session of the method in a 
general purpose digital computer. 

Figure 2 shows one example of a screen display 
according to the principles of the present invention in 
which user has entered data to compare two entities' 
highest level of activities applied to specific problems. 

Figure 3 shows one example of a screen display 
according to the principles of the present invention in 
which user has entered data to compare two entities' 
solutions to the same problem. 

Figure 4 shows one example of a screen display 
according to the principles of the present invention in 
which user has entered data to compare two entities related 
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to the respective use of the most used technical solution 
(subject) . 

Figure 5 is similar to Figure 4 after the user opens 
the most used solution folder to reveal the technical 
problems (AOs) associated with the solution (S). 

Figure 6 is a representation of one example of the 
principal stages of the system and method for implementing 
the present invention. 

Figure 7 is similar to Figure 3 but includes a display 
of the total number of patent documents in each entity set, 
the number in the open file, and the number in the open 
sub-file . 

DETAILED DESCRIPTION OF EXEMPLARY EMBODIMENT: 

The following are incorporated herein by reference: 

1. System and on-line information service presently 
available at www.cobrain.com and the publicly 
available user manual therefor. 

2. The software product presently marketed by 
Invention Machine Corporation of Boston^ USA^ 
under if^s trademark ^^KNOWLEDGIST" and the 
publicly available user manual therefor. 

3. WIPO Publication 00/14651, Published March 16, 
2000. 



4. U.S. Patent Application SN 09/541,182 filed April 
3, 2000. 

5. IMC's COBRAIN® server software marketed in the 
United States and manuals therefor. 

Assume user wants to analyze the photodetector 
technology of Honeywell and Canon and compare these two 
companies mutual approaches to solve problems associated 
with photodetectors or to compare these two companies' 
applications of photodetectors. 

An exemplary method according to the principles of the 
present invention includes user developing an SAO Knowledge 
Base (KB) generated, for example, by semantically 
processing natural language document data pursuant to 
references nos. 2, 4, or 5 above. Specifically, user uses 
this general purpose computer based system to access and 
process first and second sets of source documents that 
identifies Honeywell and Canon. 

For example, in Figures 1 and 6, user conducts a 
computer search of on-line database documents (e.g. U.S. 
Patent & Trademark Office on-line searchable patent and 
published patent application database files) for the 
patents filed between April 10,1985 - April 10, 2000. User 
entered a first request 41 into his/her computer 10 that 
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includes the present software, '"photodector" as a topic, 
'Xanon" as assignee and the period of interest as a user 
search criterion or query. Relevant Canon documents are 
downloaded to storage 30, read, i,e., processed into SAO 
structures in unit 32 and stored with links to the source 
sentence and document in SAO Knowledge Base 34. Results of 
the analysis (e.g., canon. cob) can be saved in a separate 
computer file 36. These steps are repeated for every set 
of documents {e.g., for every competing company such as 
Honeywell) user is interested in. As seen below user can 
manage SAO data and display it on monitor 40 or print, 
transmit it, etc, on the user's computer peripherals, not 
shown. User can, upon a second user command 42, also 
manage (open, close folders) the displayed data and obtain 
the displayed information on monitor 40. 

User can operate the system to manage data and build 
conclusions about the technical activities of respective 
companies indicated in the analyzed documents and, if 
desired, compare one company' s documented activities 
against the other. To identify major problems addressed in 
the processed documents user, by the S-A-0 mapping buttons 
12, 14, 16, selects Problem-Solution structure, here 
folders have action-object (AO) name and filled by subjects 
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(S) . See references no* 4 for further details of the 
mapping button function. Folders are sorted according to 
numbers of solutions they contain. For comparison, sorted 
in such manner files are displayed together such as shown 
in Figure 2. 

Note the Honeywell and Canon problem (AO) folders are 
displayed side by side. The system counts the number of 
SAO' s from all processed documents and displays this 
number. Note in window 6 Canon patents relating to 
^^photodector" filed in the period have 14,105 SAOs where 
Honeywell patents have 3,104 SAOs displayed in window 8, 
The smaller number 6, 8 (6112,2458 respectively) indicates 
the number displayed on the currently displayed page. 

The system also determines the number of subjects (Ss) 
each common AO and display the AOs in rank order of most Ss 
to least Ss within the respective folder. 

Note in Figure 2 that user immediately understands 
that Honeywell applies highest activity to technical 
problems of receiving signals, receiving light, producing 
signals etc, whereas Canon applies highest activity to 
light reflection, receiving, detection, etc. 

User can also compare how these companies solve the 
same technical problem. User simply selects the same 
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problem folder for each company, e.g., ""reflect : light'' in 
Figure 3. Note Honeywell has four folders related to this 
choice, so user can open all four folders to see the 
subjects (solutions) Honeywell used to perform the function 
reflect light. User also selects the ""reflect : light'' folder 
for Canon so that all Ss thereunder are displayed. Note 
Honeywell lists 50 SAOs having reflect light or a variant 
as an AO and Canon has 4 61 SAOs having reflect light or a 
variant thereof as an AO. 

User can see from this Figure 3, the various and 
extent of solutions (Ss) each company used to generate the 
function reflect light. 

If desired, the system can simply count and manage the 
gross number of patents (documents) from which the total 
SAOs derive in each entity set and the number of patents 
associated with each of the open folders. For example, 
note in window 15 of Figure 7, total Honeywell patents is 
202 for the set, while 21 patents are the source for the 
SAOs of the 5 open folders shown. Further 4 patents and 2 
patents, respectively, are the source documents for the 
""reflect : light" and ""reflect : light beam" folders. 

To identify major technologies in two sets of 
documents, user uses Solution-Problem (S-AO) structure 
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which user controls by the S-A-0 mapping buttons. Folders 
have subject (S) name and filled by action-objects (AO) • 
Folders are preferably sorted according to numbers of AOs 
they contain -highest to lowest. For comparison, files 
sorted in such manner are displayed as shown in Figure 4. 
Note both companies have '^photodector" technical element to 
generate the greatest number of functions (AOs) . The 
others are shown in rank order of number of AOs associated 
with the respective S. Visual or numerical analysis allows 
comparison of major technologies described in the two sets 
of documents. For similar technologies, user compares 
their different applications described in the two sets of 
documents. See Figure 5. 

In Figure 5, user selected (opened) '^photodector'' 
folder for each company to see and compare the problems or 
functions (AOs) the selected S addressed in the two 
document sets. These AOs are listed alphabetically and 
user can see that Honeywell used a '"photodector" to "check: 
output power of laser" and that Canon's documents does not 
mention this use of a photodector. Also, Canon uses the 
device to check ''output of tunable filter" but Honeywell's 
documents do not. 
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User can also note that both companies use 
photodetector to ""convert light''. The system can highlight 
with color or otherwise to denote common AOs for the 
entities displayed. This color highlighting is represented 
by the dashed boxes 20. These folders can be selected or 
opened to access the source sentence and the identity of 
the source document from which the SAOs were extracted thus 
revealing more detail of the photodector system each 
company used to convert light and/or into what light was 
converted along with a link to the full document. User 
selecting the link causes access to the full document. 
See^ for example, reference nos. 1 and 2, above. 

User can also note that Canon uses a photodetector to 
'copy' signals and copy high and low frequency signal 
segments. Honeywell, however, does not employ this physical 
effect in the documents accessed. 

Thus, user can obtain many ideas and much information 
from the information displayed in accordance with the 
principles of the present invention. Although, the above 
example provides a comparison between two sets of 
documents, it will be understood that a third or more 
company's documents can be processed and displayed for a 
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three or more document set comparison^ if desired^ in 
accordance with the present invention » 

Other and further changes and improvements can be made 
to the herein disclosed exemplary embodiments without 
departing from the spirit and scope of the present 
invention. 

It should be understood, also, that since the 
processed S-A-0 extractions result from semantically 
processing, accessed natural language documents, this 
system Ins not limited to technical documents, such as 
patents, but can effectively be applied to marketing, 
financial, manufacturing, personnel, and other entity 
downloaded or electronically stored documents and subject 
matters thereof, as well. 

It should also be understood that the above mentioned 
numbers of Ss - AOs - SAOs and those shown in the figures 
are actual numbers appearing in the full text of the stated 
Honeywell and Canon patents for the stated subject matter 
and time period down loaded from the U.S. Patent & 
Trademark Office searchable patent database- These 
companies and actual numbers were used for illustration 
purposes only and do not limit the extent and scope of the 
present invention. Further, as mentioned above, the 
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present invention includes variations of the method and 
system, such as eliminating from the counts the SAOs 
appearing in the background and claims sections and 
displaying the net count of Ss - AOs - SAOs that appear in 
one or more of the title, abstract, summary of invention, 
and detailed description sections of each patent document. 

The terms display and displayed, as used herein, 
pertain to data that is displayed and that can be displayed 
on a scrollable basis. 
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